Performance Improvement Algorithms in Big Data Analysis

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MapReduce Algorithms for Big Data Analysis

There is a growing trend of applications that should handle big data. However, analyzing big data is a very challenging problem today. For such applications, the MapReduce framework has recently attracted a lot of attention. Google’s MapReduce or its open-source equivalent Hadoop is a powerful tool for building such applications. In this tutorial, we will introduce the MapReduce framework based...

متن کامل

Bringing High Performance Computing to Big Data Algorithms

Many ideas of High Performance Computing are applicable to Big Data problems. The more so now, that hybrid, GPU computing gains traction in mainstream computing applications. This work discusses the differences between the High Performance Computing software stack and the Big Data software stack and then focuses on two popular computing workloads, the Alternating Least Squares algorithm and the...

متن کامل

Scaling Polygon Adjacency Algorithms to Big Data Geospatial Analysis

Adjacency and neighbor structures play an essential role in many spatial analytical tasks. The computation of adjacenecy structures is nontrivial and can form a significant processing bottleneck as the total number of observations increases. We quantify the performance of synthetic and real world binary, first-order, adjacency algorithms and offer a solution that leverages Python’s high perform...

متن کامل

Classification Algorithms for Big Data Analysis, a Map Reduce Approach

Since many years ago, the scientific community is concerned about how to increase the accuracy of different classification methods, and major achievements have been made so far. Besides this issue, the increasing amount of data that is being generated every day by remote sensors raises more challenges to be overcome. In this work, a tool within the scope of InterIMAGE Cloud Platform (ICP), whic...

متن کامل

Early Adoption: High-Performance Computing for Big Data Introducing parallel programming and big data in the core algorithms curriculum

Proficiency in high-performance computing (HPC) is today an essential skill for students in any computer science program. While traditional curricula provide several courses related to parallel programming, we are increasingly including parallel computing topics in our mandatory undergraduate and graduate algorithms classes. We briefly review our recent activities in this direction and outline ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Procedia Computer Science

سال: 2020

ISSN: 1877-0509

DOI: 10.1016/j.procs.2020.11.040